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Abstract 

Extensions  of  the  ML  type  system,  based  on  con¬ 
strained  type  schemes,  have  been  proposed  for  lan¬ 
guages  with  overloading.  Type  inference  in  these  sys¬ 
tems  reguires  solving  the  following  satisfiability  prob¬ 
lem.  Given  a  set  of  type  assumptions  C  over  finite 
types  and  a  type  basis  A,  is  there  is  a  substitution  S 
that  satisfies  C  in  that  A  b  C'S  is  derivable?  Un¬ 
der  arbitrary  overloading,  the  problem  is  undecidable. 
Haskell  limits  overloading  to  a  form  similar  to  that 
proposed  by  Kaes  called  parametric  overloading.  We 
formally  characterize  parametric  overloading  m  terms 
of  a  regular  tree  language  and  prove  that  although  de¬ 
cidable,  satisfiability  is  NP-hard  when  overloading  is 
parametric. 


1  Introduction 

A  practical  limitation  of  the  ML  type  system  is  that 
it  prohibits  global  overloading  in  a  programming  lan¬ 
guage  by  restricting  to  at  most  one  the  number  of  as¬ 
sumptions  per  identifier  in  a  type  context,  a  limitation 
noted  by  Milner  himself  [Mil7 8] .  Suppose  we  wish  to 
assert  that  a  free  identifier,  say  +,  has  precisely  finite 
types  int  —>■  int  —>■  int  and  real  —>■  real  —>■  real.  Any 
context  in  which  +  has  one  of  the  two  desired  finite 
types  precludes  a  derivation  that  it  has  the  other.  On 
the  other  hand,  any  context  that  assigns  type  scheme 
Va.a  — >■  a  —>  a  to  +  is  one  from  which  too  many 
types  can  be  derived  for  +.  There  is  no  type  context 
in  system  ML  from  which  we  can  derive  all  and  only 
the  desired  finite  types  for  +.  Even  system  ML  with 
subtypes  is  inadequate.  From  type  context 

A  =  {int  C  real,  +  :  real  —>■  real  —>■  real} 

t  Appeared  in  Proc.  1994  lull  Conference  on  Computer  Lan¬ 
guages,  Toulouse,  France,  pp. 88-95,  16-19  May  1994. 


one  could  derive  A  b  +  :  int  —>■  int  —>■  real  but  not 
A  b  +  :  int  —>■  int  —>■  int. 

Several  type  disciplines  have  emerged  for  pro¬ 
gramming  languages  with  overloading.  Among  them 
are  those  based  on  intersection  types  [CoD78,  Sal78, 
CDV80]  and  those  based  on  constrained  type  schemes, 
the  latter  being  inspired  by  the  design  of  Haskell 
[WaB89,  CD091,  Smi91,  Kae92,  CH092,  Jon92],  The 
type  system  of  Forsythe,  an  explicitly-typed  descen¬ 
dant  of  Algol,  is  based  on  an  intersection  type  disci¬ 
pline,  namely  Aa  [Rey88].  Though  useful,  Aa  remains 
limited  in  that  it  has  no  type  schemes  and  all  inter¬ 
sections  are  finite  [Lie90,  Pie91]. 

A  more  flexible  type  discipline  for  languages  with 
overloading  is  an  extension  of  the  ML  type  system 
with  constrained  type  schemes  [Kae92,  Smi93].  Using 
the  notation  of  [Smi93],  a  constrained  type  scheme  has 
the  general  form 

Van,  .  .  . ,  exn  with  xx  :  iq,  .  .  . ,  xm  :  rm  .  r 

where  r  is  a  finite  type.  Finite  types  are  defined  in 
the  usual  way.  Every  type  variable  a  is  a  finite  type, 
and  if  Ti,  ...  ,rn  are  finite  types  then  so  are  T\  —>■  72 
and  x(ri;  •  •  • ,  Tn)  where  x  is  a  type  constructor  of  arity 
n.  The  x\  :  T\,  .  .  .  ,xm  :  rm  are  constraints  on  over¬ 
loaded  free  identifiers  x\,  .  .  . ,  xm.  Quantifier  V  is  omit¬ 
ted  if  there  are  no  quantified  variables  and  the  with 
clause  is  omitted  if  there  are  no  constraints,  in  which 
case  we  have  an  ordinary  ML  type  scheme.  Unlike  the 
ML  type  system,  a  free  identifier  may  be  overloaded, 
that  is,  have  multiple  assumptions  in  an  initial  type 
context,  so  we  refer  to  this  extension  as  system  ML0. 

The  fact  that  a  free  identifier  is  permitted  to  have 
more  than  one  assumption  in  a  type  context  immedi¬ 
ately  raises  the  issue  of  semantic  ambiguity  in  terms. 
Care  must  be  taken  to  ensure  that  terms  with  over¬ 
loaded  identifiers  have  unambiguous  meaning.  Con¬ 
sider,  for  instance,  type  context 

+  :  real  —>■  real  —>■  real, 

+  :  Va.set(a)  — >■  set  (a)  —>■  set  (a) 
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where  +  denotes  real  addition  and  set  union.  If  one 
can  derive  from  this  context  that  term  Xx.  x  +  x  has 
type  real  —>■  real  and  Mf3.set(fi)  — >■  set(fi)  then  the 
term  can  be  interpreted  in  one  of  two  different  ways. 
Its  meaning  then  must  be  determined  by  a  process 
called  overloading  resolution  whose  outcome  depends 
on  the  type  of  x.  Thus  we  say  that  the  overload¬ 
ing  of  +  above  is  incoherent.  Surprisingly,  incoherent 
overloading  is  pervasive  among  languages  despite  its 
potential  for  semantic  ambiguity.  For  example,  “/”  is 
often  overloaded,  as  in  Ada,  to  stand  for  integer  and 
floating-point  division. 

Coherent  overloading,  on  the  other  hand,  gives  rise 
to  discrete  polymorphism  where  the  meaning  of  a  term 
does  not  depend  on  overloading  resolution.  In  this  set¬ 
ting,  a  semantics  for  an  operator,  say  /,  is  postulated 
as  a  set  of  sentences,  or  axioms,  A  say  in  first-order 
logic.  A  model  of  A  is  any  interpretation  that  satis¬ 
fies  it.  So  if  for  a  type  basis  A,  one  is  able  to  derive 
A  b  /  :  a i  and  A  b  /  :  <72,  then  the  overloading  of  / 
within  A  is  coherent  if  a\  and  a 2  are  each  models  of  A. 
If  so,  then  we  may  regard  /  as  belonging  to  the  inter¬ 
section  of  a  1  and  (T2 .  Coherent  overloading  then  allows 
the  meaning  of  every  term  to  be  uniquely  determined 
by  simply  appealing  to  the  axioms  for  the  operators 
in  question  which,  after  all,  is  where  semantics  should 
be  prescribed. 

For  instance,  suppose  A  =  {V*.  x+x  =  x}.  Both  an 
interpretation  of  +  as  set  union  and  logical  disjunction 
satisfy  A,  so  sets  and  truth  values  are  models  of  A. 
But  A  is  false  under  a  real  number  interpretation.  So 
we  would  regard  the  sentence  in  A  as  an  axiom  of  set 
theory  and  boolean  algebra,  but  not  the  first-order 
theory  of  reals  with  addition.  The  overloading  of  + 
then  in  set  (1)  is  incoherent  if  we  adopt  the  sentence 
as  an  axiom  of  our  intended  meaning  of  +.  However 
if  the  first  assumption  for  +  in  (1)  is  replaced  by  +  : 
bool  —>■  bool  —>■  bool  then  the  overloading  is  coherent. 
So  although  we  may  be  able  to  derive  from  (1)  that 
Xx.  x+x  has  type  bool  —>■  bool  and  V/3.set(/3)  — >■  set(fi), 
we  know  the  function  belongs  to  the  intersection  of  the 
two  types  and  its  meaning,  given  uniquely  by  A,  is  a 
function  that  behaves  as  the  identity  function. 


(V-intro)  A  LI  C  \~  M  :  t'  , 

A  b  C[a  :=  r],  a  not  free  in  A 
A  b  M  :  Vd  with  C .  t' 

(V-elim)  A  b  M  :  Vd  with  C  .  t' , 

A  b  C\a  :=  r] 

A  \~  M  :  r'[ a  :=  f\ 

Figure  1:  Generalization  and  specialization  in  MLa 


respectively  to  type  generalization  and  instantiation 
in  system  ML  [Mil78,  DaM82]. 

The  antecedent  of  (V-intro)  requires  C  be  satisfi- 
able  with  respect  to  A.  That  is,  for  some  finite  types 
t,  A  b  C'[ a  :=  r]  must  be  derivable.  Operators  that 
are  constrained  in  C  and  interact  share  a  type  vari¬ 
able  which  in  essence  hypothesizes  a  model  common 
to  their  semantics.  Satisfiability  of  C  then  ensures  the 
existence  of  such  a  model  assuming  overloading  is  co¬ 
herent.  If  a  model  exists  (there  may  be  more  than 
one),  then  the  meaning  of  M  is  uniquely  determined 
by  the  axioms  of  the  operators,  otherwise  M  has  no 
meaning  and  consequently  should  be  and  is  untypable. 
For  example,  suppose 

J  +  :  bool  — >■  bool  — >■  bool, 

(  +  :  Mex.set(ex)  — >■  set  (a)  — >■  set  (a) 

is  a  coherent  overloading  with  respect  to  semantics 
Ai  =  {V*.  x  +  x  =  x}  and  suppose 

J  <  :  mt  — >■  mt  — >■  bool, 

(  <  :  Mex.set(ex)  — >■  set  (a)  — >■  bool 

is  a  coherent  overloading  relative  to  an  axiomatization, 
say  A2,  of  a  partial  order.  We  can  derive  from  (2)U(3) 
that  Xx.  (x  +  x)  <  x  has  type  Ma.set(a)  —>■  bool  since 
Ai  and  A2  have  sets  as  a  common  model.  So  the 
meaning  of  the  term  is  given  by  Ai  and  A2  and  is 
a  constant  function  mapping  sets  into  true.  If  there 
were  no  common  model  then  the  axioms  could  not  be 
applied  and  the  term  would  be  meaningless. 

So  rules  (V-intro)  and  (V-elim)  give  rise  to  the  fol¬ 
lowing  satisfiability  problem. 


1.1  Satisfiability 

Two  new  type  assignment  rules,  (V-intro)  and  (V- 
elim)  given  in  Figure  1,  accompany  constrained  types. 
For  a  constraint  set  C,  the  notation  A  b  C  means 
that  for  each  constraint  x  :  t  in  C ,  A  x  :  t  is  deriv¬ 
able.  The  notation  [a  :=  r]  denotes  a  substitution, 
the  application  of  which  is  written  in  postfix  form. 
Observe  that  when  C  is  empty  the  two  rules  reduce 


Definition  1.1  The  problem  of  constraint- set  satisfi¬ 
ability  CS-SAT  is  deciding  for  a  given  set  of  type  as¬ 
sumptions  C',  involving  only  finite  types  (constraints), 
and  an  assumption  set  A,  whether  there  is  a  substitu¬ 
tion  S  such  that  A  b  C'S  is  derivable. 

Without  any  restrictions  on  the  kind  of  overload¬ 
ing  in  A,  CS-SAT  is  undecidable  [Smi91].  Constrained 
type  schemes  permit  recursive  overloadings  where  an 


+  :  real  —>■  real  —>■  real 

+  :  Va  with  +  :  a  —>  a  —>  a  . 

matrix  (a)  —>■  matrix  (a)  —>■  matrix(a) 

*  :  int  —>■  vnt  —>■  int 

*  :  real  —>■  real  —>■  real 

*  :  Va  with  +  :  a  ^  a  ^  a,  *  :  a  ^  a  ^  a  . 

matrix  (a)  —>■  matrix  (a)  —>■  matrix  (a) 

Figure  2:  A  recursive  overloading 

assumption  for  an  overloaded  identifier  has  a  con¬ 
straint  whose  satisfiability  may  depend  on  the  as¬ 
sumption  itself.  This  permits  type  assumptions  to 
be  very  expressive.  For  example,  +  and  *  are  over¬ 
loaded  recursively  in  Figure  2  due  to  constraints  on 
+  and  *.  Eliminating  recursion  altogether  makes 
CS-SAT  decidable  but  this  is  unacceptable  because  it 
arises  naturally  in  practice  as  Figure  2  shows.  Smith 
gives  a  restriction  called  overloading  by  constructors 
that  allows  CS-SAT  to  be  solved  in  polynomial  time 
[Smi91].  But  it  prohibits  the  kind  of  recursion  given 
in  Figure  2.  The  functional  language  Haskell  adopts 
another  restriction  similar  to  that  proposed  by  Kaes 
called  parametric  overloading  [Kae88]. 

2  Parametric  Overloading 

Assumption  sets  that  arise  in  practice  often  fol¬ 
low  a  very  simple  pattern  of  overloading  called  para¬ 
metric  overloading  [Kae88].  This  form  of  overload¬ 
ing  allows  natural  recursive  overloadings  and  makes 
CS-SAT  decidable.  To  define  it,  we  introduce  the 
notion  of  the  least  common  generalization  ( LCG )  of 
a  set  of  finite  types  which  captures  common  struc¬ 
ture  among  type  assumptions  for  overloaded  identi¬ 
fiers  [Rey70,  McC84]. 

Definition  2.1  A  finite  type  r  is  a  common  general¬ 
ization  of  finite  types  ri,...,rn  if  there  are  n  substi¬ 
tutions  Si,  .  .  . ,  Sn  such  that  rSi  =  t;  for  all  i;  r  is  the 
least  common  generalization  of  these  types  if  in  addi¬ 
tion  there  is  a  substitution  S  such  that  t'S  =  r  for 
any  other  generalization  t' . 

It  is  useful  to  extend  this  definition  to  identifiers. 
If  identifier  x  is  overloaded  with  constrained  type 
schemes  V71  with  C\  .  Ti ,  .  .  . ,  Vyn  with  Cn  .  rn ,  such 
that  7”i ,  .  .  . ,  Tn  has  r  as  LCG  with  free  variables  d, 
then  Va.  r  is  the  LCG  of  x. 

For  example,  if  +  is  overloaded  with  assumptions 
+  :  int  —>■  real  —>■  real  and  +  :  real  —>■  complex  —>■ 
complex  then  its  LCG  is  Va,  /?.  a  — >  /3  — >  /3. 


f,  9  -  X 1 

f  :  Va  with  g  :  a  .  X2(a) 
g  :  Va  with  /  :  a  .  X3(a) 

Figure  3:  A  mutually-recursive  overloading 

Definition  2.2  Parametric  assumption  sets  are  de¬ 
fined  inductively. 

The  empty  set  is  parametric. 

If  A  is  parametric  with  no  assumption  for  x  and  a  is 
a  constrained  type  scheme  Va  with  C  .  r  such  that  for 
each  z  :  p  E  C ,  z  is  overloaded  in  A  and  p  is  a  generic 
instance  of  its  LCG  then  A  U  {x  :  a}  is  parametric. 

If  A  is  parametric  with  no  assumption  for  x  and  B 
is  the  set 

V71  with  Ci  .  r[a  :=  xi(7i)]  1 

< 

.  x  :  Vyn  with  Cn  .  r[a  :=  Xn(7n)]  ) 
such  that 

•  x  has  LCG  Va.  r, 

•  Xi  i1  Xj  for  i  ±  j,  and 

•  z  :  p  E  Ci  implies  that  z  has  LCG  V7T.  p,  for  some 
7T  G  Ji,  and  either  z  is  overloaded  in  A  or  z  =  x, 

then  A  U  B  is  parametric. 

Examples  of  parametric  assumption  sets  are  given 
below  and  in  Figure  2. 

=  :  int  — >■  int  — >■  bool  'l 

=  :  Va,  j3  with  =:  a  — >■  a  — >■  bool .  I 

pair  (a,  j3)  — >■  pair  (a,  j3)  — >■  bool  ( 

=  :  Va.  ref  (a)  — >■  ref  (a)  — >■  bool  J 

The  last  assumption  above  specifies  a  polymorphic  in¬ 
stance  for  =,  reflecting  that  equality  is  meaningful  for 
references  (pointers). 

Parametric  assumption  sets  allow  a  limited  form  of 
recursion.  If  we  define  a  dependency  relation  among 
identifiers  in  a  type  assumption  set  that  says  identi¬ 
fier  /  depends  on  g  if  and  only  if  /  has  an  assumption 
with  a  constraint  on  g,  then  we  see  that  parametricity 
ensures  that  the  transitive  closure  of  the  relation  is 
antisymmetric  and  consequently  mutual  recursion  is 
prohibited.  For  instance,  the  set  in  Figure  3  is  mutu¬ 
ally  recursive  and  therefore  is  not  parametric.  Neither 
the  assumptions  for  /  nor  g  can  be  introduced  because 
each  requires  the  introduction  of  the  other. 


2.1  Regular  Tree  Languages 

Problem  CS-SAT  has  two  inputs,  A  and  G.  In 
practice  A  usually  varies  little  if  at  all  across  differ¬ 
ent  instances  of  type  inference.  Thus  we  can  benefit 
from  suitably  representing  A  and  reusing  its  represen¬ 
tation  for  different  inputs  G.  A  realistic  measure  of 
CS-SAT ’s  complexity  should  not  ignore  this  fact.  So 
although  A  is  an  assumption  set,  we  assume  that  as 
an  instance  of  CS-SAT,  it  is  suitably  represented.  If  A 
is  parametric  then  every  overloaded  identifier  x  has  an 
LCG  of  the  form  Vac  r  and  the  set  of  finite  types  7r  to 
which  a  can  be  instantiated,  meaning  one  can  derive 
A  \~  x  :  r[a  :=  tt],  form  a  regular  tree  language. 

Given  an  alphabet  A,  an  A-valued  tree  t  is  spec¬ 
ified  by  its  set  of  nodes,  or  domain,  dom(t),  and  a 
valuation  of  the  nodes  in  A.  Formally,  a  k-ary,  A- 
valued  tree  is  a  mapping  t  :  dom(t)  —>■  A  where 
dom(t)  C  {0,  .  .  . ,  k  —  1}*  is  a  nonempty  set  and  closed 
under  prefixes.  The  frontier  of  t  is  the  set  of  nodes 
{w  G  dom(t)  |  -i 3i.wi  G  dom(t)}.  We  assume  that  A 
is  partitioned  into  a  ranked  alphabet  E,  and  a  frontier 
alphabet  X .  For  any  E  and  X ,  we  denote  the  set  of  all 
finite  EX-trees  by  Fs(X). 

Regular  tree  languages,  or  forests,  can  be  charac¬ 
terized  in  different  ways  using  tree  recognizers  (au¬ 
tomata)  [GeS84]  or  familiar  operations  of  regular  sets, 
like  concatenation  and  closure,  extended  to  finite  sets 
of  trees  [Tho90].  To  simplify  our  proofs,  we  choose 
to  characterize  them  as  forests  generated  by  a  class  of 
context-free  grammars  called  the  regular  tree  gram¬ 
mars  [GeS84]. 

Definition  2.3  A  regular  EX- grammar  G  consists  of 

•  a  finite  nonempty  set  N  of  nonterminal  symbols, 

•  a  finite  set  P  of  productions  A  —>■  r  where  A  G  X 
and  r  G  F^(N  U  X),  and 

•  an  initial  symbol  S  G  X. 

Definition  2.4  If  G  =  ( N,T,X,P,S )  is  a  regular 
EX -grammar  then  the  EX -forest  generated  by  G  is 
T(G)  =  {t  G  XS(X)  \S^*Gt}. 

From  a  given  parametric  assumption  set  A,  the 
idea  is  to  construct  for  each  overloaded  identifier  x 
a  regular  tree  grammar  Gx  such  that  if  x  has  LCG 
Va.  r  then  for  any  closed  (variable-free)  finite  type  tt, 
A  b  x  :  r[a  :=  tt]  is  derivable  if  and  only  if  7r  G  T(GX). 
So  determining  whether  constraint  x  :  r[a  :=  tt]  is  sat- 
ishable  with  respect  to  A  amounts  to  parsing  tt.  Gx 
always  has  a  nonempty  ranked  alphabet  of  type  con¬ 
structors  xi,  ■  ■  ■ ,  Xn  and  an  empty  frontier  alphabet. 


So  we  drop  the  frontier  alphabet  from  discussion  and 
speak  of  just  E-trees  from  now  on,  the  collection  of 
which  is  Fs  for  a  given  E. 

Critical  to  our  representation  of  a  parametric  over¬ 
loading  is  the  property  that  regular  forests  are  effec¬ 
tively  closed  under  intersection.  This  implies  they  are 
properly  contained  within  the  context-free  languages 
since  the  latter  are  not  closed  under  intersection. 

Theorem  2.1  If  G i  and  G 2  are  regular  tree  gram¬ 
mars  then  T(G\)  fl  T{Gr2)  is  generated  by  a  regular 
tree  grammar. 

Proof.  Suppose  Gi  =  (Xi ,  E,  Pi,  Si)  and  G2  = 
(X2,  E,  P2,  S2)  are  regular  E-grammars.  Let  E- 
grammar  G  =  (Xi  x  X2 ,  E ,  P,  [Si ,  S2])  where 

[A,B\^a([Y1,Z1\,...,[Yn,Zn])eP,  for  n>0 

if  and  only  if  A  — >■  a(Yi,...,Yn)  G  Pi,  B  — >■ 
a(Zi,...,Zn)  G  P2,  and  a  G  E.  Then  T(G)  = 
T(Gi)  fl  T(G2).  □ 

Suppose  x  is  overloaded  in  an  initial  parametric  as¬ 
sumption  set  A  with  LCG  Va.  r  and  that  E  contains 
all  type  constructors  of  A.  We  construct  Gx  as  follows. 
Since  the  overloading  for  x  may  be  recursive,  we  first 
factor  all  assumptions  on  x  into  two  sets,  one  contain¬ 
ing  its  assumptions  without  any  constraints  on  x  and 
the  other  having  its  assumptions  with  only  constraints 
on  x  if  any.  Gx  then  is  the  intersection  of  the  regular 
E-grammars  representing  the  two  sets.  These  two  tree 
grammars  cannot  depend  on  Gx  since  the  transitive 
closure  of  the  dependency  relation  is  antisymmetric. 

A  regular  E-grammar  is  constructed  for  each  set  as 
follows.  For  each  assumption 

*  :  V71,  .  .  .,7„  with  C  .  r[a  :=  ■  ■  ,1  n)] 

introduce  n  nonterminals  A\ ,  .  .  . ,  An  and  create  a  pro¬ 
duction  S  —>■  x(Ai,  .  .  . ,  An)  such  that  Ai  derives  ex¬ 
actly  nr=i T(GZk)  if  7 i  appears  in  constraints  on 
z\,...,zm  in  C  and  derives  F-%  otherwise.  By  The¬ 
orem  2.1,  the  intersection  can  be  described  by  a  regu¬ 
lar  E-grammar.  Nonterminal  S  is  the  start  symbol  of 
the  grammar.  The  finite  types  derivable  from  Ai  cor¬ 
respond  precisely  to  those  types  that  satisfy  all  con¬ 
straints  in  C  involving  7 ; . 

For  example,  we  construct  regular  E-grammars  G+ 
and  G*  for  the  parametric  assumption  set  in  Figure  2. 
Let  Eo  =  {mt,  real}  and  Ei  =  {matrix}.  Due  to  the 
constraint  on  +  needed  to  assert  that  *  may  stand 
for  matrix  multiplication,  construction  of  G*  depends 
on  G+.  So  we  begin  by  factoring  the  assumptions  for 


+  ,  leading  to  two  regular  tree  grammars  G i  and  G 2 
where  G 1  is 

Si  — >■  real  \  matrix  (U) 

U  —>■  mt  |  real  \  matrix  (U) 

and  G2  is 

—>■  real  \  matnx(S2) 

G\  arises  from  the  assumptions  for  +  with  the  lone 
constraint  on  +  deleted.  Therefore  U  derives  F^.  G 2 
on  the  other  hand  is  constructed  from  the  assumptions 
with  only  constraints  on  +  which  in  this  example  is 
the  same  as  the  original  set.  The  regular  E-grammar 
G+  for  T(Gi)  fl  T{G2)  becomes 

[Si ,  S2]  — >■  real  \  matrix ([IJ,  S2]) 

[U,  S2]  — >■  real  \  matrix ([IJ,  S2\) 

Next  we  construct  G*.  Corresponding  to  assump¬ 
tions  for  *  without  any  constraints  on  *  is  the  grammar 

S3 —>■  int  |  real  \  matrix([Si,  S2]) 

and  to  the  assumptions  with  only  constraints  on  *, 

S4  —>■  int  |  real  \  matrixes 4) 

G*  then  represents  their  intersection  and  is  given  by 

[53,54]— >  int  |  real  \  matnx([[Si,  S2],  S4]) 

[[Si,  S2],  S4]  — >■  real  \  matrix ([[U,  S2],  S4]) 

[[U,  S2],  S4] —>■  real  \  matrix ([[U,  S2],  S4]) 

Now  if  A  denotes  the  set  of  Figure  2,  then  for  any 
closed  finite  type  r,  j4b  +  :r^r^ris  derivable  if 
and  only  if  r  E  T(G+),  likewise  for  T(G*).  This  actu¬ 
ally  follows  from  the  next  theorem  which  establishes 
the  correctness  of  the  representation. 

Theorem  2.2  If  A  is  parametric  and  x  is  overloaded 
in  A  with  LCG  Vac  r  and  regular  T,-grammar  Gx  = 
( N ,  E,  P,  S )  then  A  h  x  :  r[a  :=  7r]  iff  it  E  T(GX). 

Proof.  We  use  a  normalized  version  of  ML0,  replacing 
(V-elim)  with  rule  (V-elim'): 

x  :  Vd  with  G .  r'  E  A,  A  h  C[d  :=  r] 

A  b  x  :  r'[d  :=  r] 

The  normalized  version  and  MLa  are  proved  equiva¬ 
lent  in  [Smi91].  We  prove  7r  E  T(GX)  implies  A  h  x  : 
r[a  :=  7r]  by  induction  on  the  structure  of  7r: 

(7 r  =  x).  If  x  G  T(GX)  then  S  — >■  \  C  P  which 


implies  x  :  r[a  :=  x]  G  A.  By  rule  (hypoth)  then 
A  b  x  :  r[a  :=  x\- 

(tt  =  x(r)).  If  x(t)  E  T(Gx)  then  S  x(t)  G 
P,  n  E  T(Gi),  where  G*  =  (N,  E,  P,  ji),  and  x  : 
V7  with  G  .  r[a  :=  x(t)]  G  A.  Suppose 
are  all  identihers  constrained  in  G  by  7 ; .  Since  T 
is  parametric,  Zk  '■  Pk  &  C  implies  zj,  has  LCG  V7 pk 
and  Zk  is  overloaded  in  A.  By  the  construction  of  Gx 
we  have  T(Gi)  =  fl™=i  T{GZ  J  so  r8-  E  T(GZk  )  for 
k  =  1,  .  .  . ,  m.  By  the  inductive  hypothesis,  A  h  zj,  : 
[7*  :=  r*']-  So  by  rule  (V-elim'),  A  \~  x  :  r[a  :=  x(V)]. 
Next  we  prove  that  A  \~  x  :  r[a  :=  tt]  implies  7r  E 
T(GX)  by  induction  on  the  length  of  the  derivation 
of  A  h  x  :  r[a  :=  7r].  The  derivation  ends  with  an 
application  of  rule  (hypoth)  or  rule  (V-elim'): 

(hypoth).  If  x  :  r[a  :=  x]  E  A  then  S'^x6-P  which 
implies  x  G  T(GX). 

(V-elim').  The  derivation  ends  with 

x  :  V7  with  G.r[a  :=x(t)]  C  a, 

A  b  C[j  :=  V] 

A  b  x  :  r[a  :=  x(V)] 

Suppose  zi,  .  .  . ,  zm  are  all  identihers  constrained  in  G 
by  ji.  Since  A  is  parametric,  zj,  :  pk  G  G  implies  zj, 
has  LCGMji.  pk  and  zj,  is  overloaded  in  T.  Then  T  b 
G[y  :=  7r]  implies  A  h  zj,  :  Pfc[7i  :=  7Tj]  so  by  the  induc¬ 
tive  hypothesis  717  E  T(GZk)  for  k  =  1,  .  .  . ,  m,  or  717  E 
P™=1  Now  *  :  V7  with  G  .  r[a  :=  x(V)]  G  A 

implies  S  —>■  x(V)  G  P •  By  virtue  of  the  con¬ 
struction  of  Gx,  we  have  717  E  T(Gi)  and  therefore 

x(v)  e  t(gx).  □ 

3  CS-SAT  is  NP-hard  for  Parametric 
Overloading 

The  NP  lower  bound  is  proved  by  factoring  a  re¬ 
duction  from  3CNF-SAT  through  the  problem  of  com¬ 
puting  the  intersection  of  a  sequence  of  regular  forests. 
Though  this  is  unnecessary  and  a  simpler  proof  is  pos¬ 
sible,  it  is  done  in  order  to  isolate  the  source  of  the 
hardness  which  lies  in  computing  this  intersection. 

Theorem  3.1  Given  a  parametric  assumption  set  A 
with  overloaded  identifiers  xi,...,x„  whose  assump¬ 
tions  are  represented  by  regular  tree  grammars  and  a 
constraint  set  C  over  x\,  .  .  . ,  xn  such  that  x  :  p  E  G 
implies  p  is  a  generic  instance  of  the  LCG  of  x  in  A, 
deciding  whether  C  is  satisfiable  under  A  is  NP-hard. 

Proof.  We  give  a  P-time  reduction  from  3CNF-SAT. 
Given  a  3CNF  formula  E,  consisting  of  clauses 


d\,  .  .  .  ,dn,  we  construct  a  parametric  assumption  set 
Ae,  with  all  overloadings  represented  by  regular  tree 
grammars,  and  a  constraint  set  C  such  that  C  is  sat- 
ishable  under  Ae  if  and  only  if  E  is  satishable. 

Suppose  E  has  m  distinct  variables  x\,  ... ,  xm  and 
let  the  ranked  terminal  alphabet  E  =  Eo  U  Si  where 
So  =  {e}  and  Si  =  {T,  F}.  Construct  a  regular  S- 
grammar  Gd ,  for  each  clause  d;  so  that  a  E  T(Gdt)  if 
and  only  if 

a  =  EhiBzi- ■  ■  Bm(e)  ■  ■  ■)) 

and  the  assignment  of  truth  values  B\ ,  .  .  . ,  Bm  to 
x\,  .  .  . ,  xm  respectively  satisfies  d{.  If  d;  contains  vari¬ 
ables  Xj,  xk,  and  xj,  with  j  <  k  <  l,  and  xj  <—  Bj, 
xk  <—  Bk,  and  xj  <—  Bj  is  a  truth  assignment  satisfy¬ 
ing  di,  then  construct  a  regular  E-grammar  with  start 
symbol  x\  and  productions 

Xj  Bj(xj+ 1)  xk  —>■  Bk(xk+1)  X]  Bi(xi+i) 

and  for  1  <  i  <  m  with  i  j,  i  k,  and  i  y^  /, 

Xi  —>T{xi+i)  |  F(xi+i) 

and  Hnally  xm+i  —>■  e.  There  is  one  such  regular  X- 
grammar  for  each  of  the  7  truth  assignments  satisfying 
di,  call  them  G 1,  ■  ■  ■ ,  G7.  Then  let 

7 

T(Gdi)={jT(Gk) 

k= 1 

Gd ,  can  be  constructed  in  O(m)  steps  so  that  for  each 
nonterminal  Y  and  truth  value  B,  there  is  at  most  one 
production  of  the  form  Y  —>■  B(Z).  For  1  <  i  <  n, 
add  to  Ae  assumption  X{  :  e  if  X  —>■  e  is  a  production 
of  Gd ,  and  assumption 

Yi  :  Va  with  Z{  :  a  .  B(a) 

if  Y  —>■  B(Z)  is  a  production  of  Gd,-  If  Gd ,  has  start 
symbol  Si,  then  with 

C  =  {Si  :  a,  .  .  . ,  Sn  :  a} 

E  is  satishable  if  and  only  if  n"=i  T{Gdi)  is  nonempty, 
or  if  and  only  if  C  is  satishable  under  Ae-  HU 

As  is  the  case  for  deciding  whether  a  sequence  of  fi¬ 
nite  automata  accept  a  common  string,  the  source  for 
the  hardness  of  CS-SAT  lies  not  in  deciding  emptiness 
but  rather  in  computing  the  intersection,  in  this  case, 
of  a  sequence  of  regular  forests  T(G\),  .  .  .  ,T(Gm). 
The  emptiness  of  T(G)  for  a  regular  tree  grammar 
G  is  decidable  in  time  0(\  G  |2)  in  the  usual  way. 


From  the  proof  of  Theorem  3.1  then  every  problem 
in  NP  is  P-time  Turing  reducible  to  the  problem  of 
constructing  the  intersection  of  a  sequence  of  regular 
tree  grammars,  so  the  construction  is  NP-hard.  This 
helps  to  explain  why  the  worst-case  time  complexity  of 
an  improved  algorithm  for  computing  the  intersection 
of  regular  forests  is  still  exponential  [AiM91].  Actu¬ 
ally  computing  the  intersection  is  much  harder.  A 
weak  PSPACE-hard  lower  bound  follows  immediately 
from  the  finite  automaton  intersection  problem,  treat¬ 
ing  strings  as  unary  trees.  A  tighter  exponential  time 
lower  bound  follows  from  the  complexity  of  the  in¬ 
tersection  problem  for  tree  automata  [FSV91].  For  a 
fixed  m,  it  can  be  computed  in  polynomial  time. 


4  Conclusion 

Some  might  argue  that  given  that  ML  typability 
is  complete  for  DEXPTIME  [KTU90],  the  fact  that 
CS-SAT  is  NP-hard  is  insignificant.  If  we  were  con¬ 
cerned  only  about  the  worst-case  time  complexities 
of  type  inference  algorithms  then  this  might  be  true. 
But  experience  has  shown  that  the  DEXPTIME  lower 
bound  is  not  an  issue  in  practice  and  type  inference 
algorithms  whose  worst-case  time  complexities  are  ex¬ 
ponential  perform  quite  well  on  pratical  programs.  In 
fact  it  was  folklore  for  many  years  that  ML  typability 
could  be  decided  in  polynomial  time.  So  the  complex¬ 
ity  of  CS-SAT  could  very  well  be  the  dominating  com¬ 
plexity  in  practice.  More  experience  is  needed  though 
in  using  systems  like  MLa  to  determine  whether  the 
NP  lower  bound  for  CS-SAT  is  a  practical  limitation. 
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